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DETAILED ACTION 



1. 



This office action is in response to the amendment filed on Dec. 23 2008. 



2. 



Claims 1,7, 13, and 19 are amended 



3. 



Claims 1, 5-8, 11-13, 17-19, 23 and 24 are pending in this application. 



Claim Rejections - 35 USC § 103 



4. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Paieniabilhy shall not be negatived by the 
manner in which the invention was made. 

5. Claims 1, 5-8, 11-13 and 17-18 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Polak (U.S. Patent No. 6,226,627 Bl hereinafter "Polak"), in view of Li (U.S. Patent No. 
6,625,500 Bl hereinafter "Li"). 

Per claims 1 and 13 
Polak discloses 

A method of instructing a computer program to self-optimize, said method comprising: 
■ inputting a selection command that selects one function from a list of pre- selected 
functions for input into said computer program at a point of choice determined by a 
programmer (Col.7 lines 54-59 "If multiple actions are enabled, the dependency action 
system uses a selection rule to determine which of the enabled actions is to be executed. 
It is possible to use a selection rule based on random choice. A policy can be pre- 
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established that prescribes which action of a set of enabled actions is to be selected for 
execution. " & col. 12 line 66 ~ col. 13 line 6 "The enabled action selection rule 
determines which of the enabled actions 620 and 622 is tried first. Similarly, if the action 
620 does not update the storage location 632, but updates the storage location 630, the 
selection rule determines whether the action 640 or the action 622 is used for recovery. 
This behavior can be change by the designer of the dependency action system by adding 
additional dependencies. "). 

■ allowing a learning protocol comprised of learning instructions in said computer program 
(col.4 lines 45-50 "The method and system of this invention are derived from biological 
systems and organisms that achieve resilience and adaptability through redundant 
functional components, planning the use of these components to achieve a given task, and 
learning appropriateness of components from experience .'") to track and reward said one 
function that is selected and to determine an approximate optimal policy of choice of 
operation of said computer program based on said selection command (col. 8 lines 49-57 
"An enabled action selection rule is used to select one of several enabled actions for 
execution. Different selection rules can be used, including a trivial selection rule that 
picks one of the enabled actions at random. This invention specifies an enabled action 
preference policy that is described below with respect to FIG. 73."). 

But Polak does not disclose 

■ each function from said list of pre- selected functions is associated with a reward 
However, Li explicitly discloses 
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■ wherein each function from said list of pre- selected functions is associated with a reward 
(col. 16 lines 5-9 "the computer can also provide multiple choices of simple reward-risk 
choices from the highly efficient experiments for the manager to select to decide, e.g., 
whether to continue the next self-optimizing cycle. "). 

■ Therefore, it would have been obvious to a person of ordinary skill in the art at the time 
the invention was made to modify teaching of Polak with the teachings of Li to include 
each function from said list of pre- selected functions is associated with a reward in order 
to provide a self-optimizing machine or method to generate computer code in real time 
into the computer-generated knowledge bases. 

Per claims 5, 17 

the rejection of claims 1 and 13 are incorporated respectively and Li further discloses 

■ comprising inputting a rule command that establishes a rule for said computer program 
on how to determine said approximate optimal choice of operation (col. 16 lines 55-57 

"self-optimizing machine readily and automatically generates these and other similar 
rules in computer-coded form ready for use as instant machine expert systems..."). 

Per claims 6, 18 

the rejection of claims 1 and 13 are incorporated and Li further discloses 

■ comprising inputting a reward command that provides a reward, at a point of choice, 
determined by a programmer, in said computer program, for said one function selected by 
said selection command, which results in said approximate optimal choice for self- 
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optimizing said computer program (col. 16 lines 5-9 "the computer can also provide 
multiple choices of simple reward-risk choices from the highly efficient experiments for 
the manager to select to decide, e.g., whether to continue the next self-optimizing 
cycle. "). 

Per claim 7 
Polak discloses 

A method of optimizing a computer program, said method comprising: 

■ specifying at least one point of choice, determined by a programmer, in said computer 
program (col. 8 lines 11-15 "This invention preferably uses a specific type of selection 
rule, called a preference policy, that is based on a preference relation that specifies if 
one action is to be preferred over another action"). 

■ defining a set of alternate choices at each point of choice ([Abstract] "if one action fails 
to produce a value to a storage location, other alternative actions may still be enabled 
and can be executed"), wherein said set of alternate choices include operational choices 
comprises: 

■ inputting a selection command that selects one function from a list of pre-selected 
functions into said computer program (col.7 lines 54-57 "If multiple actions are enabled, 
the dependency action system uses a selection rule to determine which of the enabled 
actions is to be executed. It is possible to use a selection rule based on random choice") 

■ allowing a learning protocol comprised of learning instructions in said computer program 
(col.4 lines 45-50 "The method and system of this invention are derived from biological 
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systems and organisms that achieve resilience and adaptability through redundant 
functional components, planning the use of these components to achieve a given task, and 
learning appropriateness of components from experience .'") to track and reward said one 
function that is selected to determine an approximate optimal operation of said computer 
program based on said selection command (col. 8 lines 49-57 "An enabled action 
selection rule is used to select one of several enabled actions for execution. Different 
selection rules can be used, including a trivial selection rule that picks one of the enabled 
actions at random. This invention specifies an enabled action preference policy that is 
described below with respect to FIG. 13"). 
But Polak does not disclose 

■ setting at least one feedback point for each point of choice and each function from said 
list of pre-selected functions is associated with a reward. 

However, Li discloses 

■ setting at least one feedback point for each point of choice (col.4 lines 46-52 "An 
additional object of the invention is to provide self optimizing machine and method which 
actively computes, and automatically sets at, the instantaneous optimal combinations of 
the many relevant variables in various categories, with instant feed-back to supply data 

I for immediate replanning, retesting, and reoptimiizng"). 

■ wherein each function from said list of pre-selected functions is associated with a reward 
(col. 16 lines 5-9 "the computer can also provide multiple choices of simple reward-risk 
choices from the highly efficient experiments for the manager to select to decide, e.g., 
whether to continue the next self-optimizing cycle. "). 
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■ Therefore, it would have been obvious to a person of ordinary skill in the art at the time 
the invention was made to modify teaching of Polak with the teachings of Li to include 
foregoing descriptions in order to provide a self-optimizing machine or method to 
generate computer code in real time into the computer-generated knowledge bases. 

Per claim 8 

the rejection of claim 7 is incorporated and Polak further discloses 

comprising allowing a learning protocol in said computer program to determine an approximate 
optimal operation of said computer program based on said specifying (col. 8 lines 11-12 "This 
invention preferably uses a specific type of selection rule, called a preference policy , \ defining 
(col.5 lines 59-64 ". . . The actions 20 define the functional components of the dependency action 
system in terms of the data that is required and the data that is produced", and setting (col. 13 
lines 13-15 "The dependency action system control routine begins in step S1000. Control then 
continues to steps 1100, where the dependency action system is initialized by setting all storage 
locations to "empty".'"). 

Per claim 1 1 

the rejection of claim 8 is incorporated and Polak further discloses 
set of alternate choices include operational choices, further comprising: 

■ inputting a rule command into said computer program, wherein said rule command 
establishes a rule on how to determine said approximate optimal operation (col. 8 lines 
11-15 "This invention preferably uses a specific type of selection rule, called a preference 
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policy, that is based on a preference relation that specifies if one action is to be preferred 
over another action"). 

Per claim 12 

the rejection of claim 8 is incorporated and Li further discloses 

set of alternate choices include operational choices, and wherein said method further comprises: 

■ inputting a reward command into said computer program at a point of choice, determined 
by a programmer, wherein said reward command provides reward in said computer 
program, which results in said approximate optimal choice for optimizing said computer 
program (col. 16 lines 5-9 "the computer can also provide multiple choices of simple 
reward-risk choices from the highly efficient experiments for the manager to select to 
decide, e.g., whether to continue the next self-optimizing cycle. "). 

6. Claims 19, 23-24 are rejected under 35 U.S.C. 103(a) as being unpatentable over Polak, 
in view of Li and further view of Christensen et al. (U.S. Patent No. 5,333,304 Al hereinafter 
"Christensen"). 
Per claim 19 
Polak discloses 

A computer system comprising: 

■ Processor adapted to execute a learning protocol in said computer program to track and 
reward said one function that is selected and determine an approximate optimal operation 
of said computer program based on at least said selection command (col.8 lines 49-57 
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"An enabled action selection rule is used to select one of several enabled actions for 
execution. Different selection rules can be used, including a trivial selection rule that 
picks one of the enabled actions at random. This invention specifies an enabled action 
preference policy that is described below with respect to FIG. 13") 

■ input a selection command at a point of choice, determined by a programmer, into a 
computer program that runs a computer, said selection command selecting one function 
from a list of pre-selected functions for input into said computer program (Col. 7 lines 54- 
59 "If multiple actions are enabled, the dependency action system uses a selection rule to 
determine which of the enabled actions is to be executed. It is possible to use a selection 
rule based on random choice. A policy can be pre-established that prescribes which 
action of a set of enabled actions is to be selected for execution. " & col. 12 line 66 ~ 

col. 13 line 6 "The enabled action selection rule determines which of the enabled actions 
620 and 622 is tried first. Similarly, if the action 620 does not update the storage 
location 632, but updates the storage location 630, the selection rule determines whether 
the action 640 or the action 622 is used for recovery. This behavior can be change by the 
designer of the dependency action system by adding additional dependencies. "). 
Li discloses 

■ wherein each function from said list of pre-selected functions is associated with a reward 
(col. 16 lines 5-9 "the computer can also provide multiple choices of simple reward-risk 
choices from the highly efficient experiments for the manager to select to decide, e.g., 
whether to continue the next self-optimizing cycle. "). 

But both Polak and Li do not disclose 
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■ a pre-compiler that inputs a selection command 
However, Christensen discloses 

■ a pre-compiler that inputs a selection command (col.2 lines 16-26 "A software 
application to be evaluated is compiled utilizing a compiler application which inserts at 
least one prelude routine command during compiling which is utilized to invoke the 
prelude routine utilized upon each entry to a procedure within the software application. 
A command invoking the evaluation function is inserted into the prelude routine such that 
each entry to a procedure within the software application under evaluation will result in 
the storage of execution information at that point. "). 

■ Therefore, it would have been obvious to a person of ordinary skill in the art at the time 
the invention was made to combine teachings of Polak and Li and further include a pre- 
compiler that inputs a selection command by the teachings of Christensen in order to 
provide a method for evaluating software application performance utilization a compiler 
application ([abstract]). 



Per claim 23 

the rejection of claim 19 is incorporated Li further discloses 

■ input a rule command into said computer program that runs on said computer, said rule 
command established a rule for said computer program on how to determine said 
approximate optimal choice of operation (col. 16 lines 55-57 "self-optimizing machine 
readily and automatically generates these and other similar rules in computer-coded 
form ready for use as instant machine expert systems..."). 
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Per claim 24 

the rejection of claim 19 is incorporated Li further discloses 

■ input a reward command that provides a reward, at a point of choice in said computer 
program which runs on said computer, for said one function selected by said selection 
command, which results in said approximate optimal choice for self- optimizing said 
computer program (col. 16 lines 5-9 "the computer can also provide multiple choices of 
simple reward-risk choices from the highly efficient experiments for the manager to select 
to decide, e.g., whether to continue the next self-optimizing cycle. "). 

Response to Arguments 

Applicant's arguments filed on Dec. 23 2008 have been fully considered but they are not 
persuasive. 

■ In the remarks, Applicant argues that: 

(a) Claims 1,7, 13 and 19, Polak fails to disclose, teach or suggest "inputting a selection 
command that selects one function from a list of pre- selected functions for input into said 
computer program at a point of choice determined by a programmer" and "allowing a learning 
protocol comprised of learning instructions". 

(b) Li fails to remedy "each function from said list of pre-selected functions is associated 
with a reward" of Polak 
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Examiner's response: 

(a) Examiner recited different paragraph of Polak to clarify those features in independent 
claims. Polak discloses inputting a selection command that selects one function from a list of 
pre- selected functions for input into said computer program at a point of choice determined by a 
programmer (Col. 7 lines 54-59 "If multiple actions are enabled, the dependency action system 
uses a selection rule to determine which of the enabled actions is to be executed. It is possible to 
use a selection rule based on random choice. A policy can be pre-established that prescribes 
which action of a set of enabled actions is to be selected for execution. " & col. 12 line 66 ~ 
col. 13 line 6 "The enabled action selection rule determines which of the enabled actions 620 and 
622 is tried first. Similarly, if the action 620 does not update the storage location 632, but 
updates the storage location 630, the selection rule determines whether the action 640 or the 
action 622 is used for recovery. This behavior can be change by the designer of the dependency 
action system by adding additional dependencies. "). Also, Polak further discloses allowing a 
learning protocol comprised of learning instructions in said computer program. In the 
background of Polak's invention (col.l lines 8-1 1 "This invention is directed to methods and 
systems for building and executing complex software. In particular, this invention relates to self- 
optimizing software employing learning, and planning techniques.'" & col. 4 lines 45-50 "The 
method and system of this invention are derived from biological systems and organisms that 
achieve resilience and adaptability through redundant functional components, planning the use 
of these components to achieve a given task, and learning appropriateness of components from 
experience .'") Those learning of software or component are implicitly comprised of learning 
instructions. 
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(b) Li discloses "each function from said list of pre- selected functions is associated with a 
reward" (col. 16 lines 5-9 "the computer can also provide multiple choices of simple reward-risk 
choices from the highly efficient experiments for the manager to select to decide, e.g., whether to 
continue the next self-optimizing cycle. ") 

Conclusion 

7. THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until after 
the end of the THREE-MONTH shortened statutory period, then the shortened statutory period 
will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 
CFR 1 .136(a) will be calculated from the mailing date of the advisory action. In no event, 
however, will the statutory period for reply expire later than SIX MONTHS from the mailing 
date of this final action. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Junchun Wu whose telephone number is 571-270-1250. The 
examiner can normally be reached on 8:00-17:00 M-F. 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Wei Zhen can be reached on 571-272-3708. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 1 
Information regarding the status of an application may be obtained from the Patent Application 
Information Retrieval (PAIR) system. Status information for published applications may be 
obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would 
like assistance from a USPTO Customer Service Representative or access to the automated 
information system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 



JW 

/Wei Y Zhen/ 

Supervisory Patent Examiner, Art Unit 2191 



